Cost-Based Learning for Planning

نویسندگان

  • Srinivas Nedunuri
  • William R. Cook
  • Douglas R. Smith
چکیده

Most learning in planners to date has been focused on speedup learning. Recently the focus has been more on learning to improve plan quality. We introduce a different dimension: learning not just from failed plans, but learning from inefficient plans. We call this cost-based learning (CAL). CBL can be used to improve both plan quality and provide speedup learning. We show how cost-based learning can also be used to learn plan rewrite rules that can be used to rewrite an inefficient plan to an efficient one, in the style of Planning by Rewriting (PbR). We do this by making use of dominance relations. Additionally, the learned rules are compact and do not rely on state information so they are fast to match.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A production-inventory model with permissible delay incorporating learning effect in random planning horizon using genetic algorithm

This paper presents a production-inventory model for deteriorating items with stock-dependent demand under inflation in a random planning horizon. The supplier offers the retailer fully permissible delay in payment. It is assumed that the time horizon of the business period is random in nature and follows exponential distribution with a known mean. Here learning effect is also introduced for th...

متن کامل

A Transformational Analysis of the Ebl Utility Problem in Soar

EEciency is a major concern for all planning systems. One way of achieving eeciency is the application of learning techniques to speed up planning. Accordingly, there has been considerable amount of research on applying EBL (explanation-based learning) techniques to planning. However, EBL is known to suuer from the utility problem, where the cost of using the learned knowledge overwhelms its be...

متن کامل

Model-Free Imitation Learning with Policy Optimization

In imitation learning, an agent learns how to behave in an environment with an unknown cost function by mimicking expert demonstrations. Existing imitation learning algorithms typically involve solving a sequence of planning or reinforcement learning problems. Such algorithms are therefore not directly applicable to large, high-dimensional environments, and their performance can significantly d...

متن کامل

ارائه الگوی مناسب بهای تمام‌شده در صنعت فرش (فرش دستباف)

Proper and rational collecting, classifying and regular reporting of financial data in a manufacturing unit requires establishing an appropriate and compiled cost accounting data system so that based on these reports, the managers of the manufacturing units can make their decisions for planning, control of production and also cost reduction. Since the hand-knitted carpet industry is a competito...

متن کامل

Rrt-hx: Rrt with Heuristic Extend Operations for Motion Planning in Robotic Systems

This paper presents a sampling-based method for path planning in robotic systems without known cost-to-go information. It uses trajectories generated from random search to heuristically learn the cost-to-go of regions within the configuration space. Gradually, the search is increasingly directed towards lower cost regions of the configuration space, thereby producing paths that converge towards...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011